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Foreword 



rd , 



This Technical Specification (TS) has been produced by the 3 Generation Partnership Project (3GPP). 

The contents of the present document are subject to continuing work within the TSG and may change following formal 
TSG approval. Should the TSG modify the contents of the present document, it will be re-released by the TSG with an 
identifying change of release date and an increase in version number as follows: 

Version x.y.z 

where: 

X the first digit: 

1 presented to TSG for information; 

2 presented to TSG for approval; 

3 or greater indicates TSG approved document under change control. 

y the second digit is incremented for all changes of substance, i.e. technical enhancements, corrections, 
updates, etc. 

z the third digit is incremented when editorial only changes have been incorporated in the document. 
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Scope 



The present document contains an electronic copy of the ANSI-C code for the Floating-point Adaptive Multi-Rate 
Wideband codec. This floating-point codec specification is mainly targeted to be used in multimedia applications or in 
packet-based applications. The bit-exact fixed-point ANSI-C code in 3GPP TS 26.173 remains the preferred 
implementation for all applications, but the floating-point codec may be used instead of the fixed-point codec when the 
implementation platform is better suited for a floating-point implementation. It has been verified that the fixed-point 
and floating-point codecs interoperate with each other without any artifacts. 

The floating-point ANSI-C code in the present document is the only standard conforming non-bit-exact implementation 
of the Adaptive Multi-Rate Wideband speech transcoder (3GPP TS 26.190 [2]), Voice Activity Detection 
(3GPP TS 26.194 [6]), comfort noise generation (3GPP TS 26.192 [4]), and source controlled rate operation 
(3GPP TS 26.193 [5]). The floating-point code also contains example solutions for substituting and muting of lost 
frames (3GPP TS 26.191 [3]). 

The fixed-point specification in 26.173 shall remain the only allowed implementation for the 3G AMR-WB speech 
service and the use of the floating-point codec is strictly limited to other services. 

The floating-point encoder in the present document is a non-bit-exact implementation of the fixed-point encoder 
producing quality indistinguishable from that of the fixed-point encoder. The decoder in the present document is 
functionally a bit-exact implementation of the fixed-point decoder, but the code has been optimized for speed and the 
standard fixed-point libraries are not used as such. 



References 



The following documents contain provisions which, through reference in this text, constitute provisions of the present 
document. 

• References are either specific (identified by date of publication, edition number, version number, etc.) or 
non-specific. 

• For a specific reference, subsequent revisions do not apply. 

• For a non-specific reference, the latest version applies. In the case of a reference to a 3GPP document (including 
a GSM document), a non-specific reference implicitly refers to the latest version of that document in the same 
Release as the present document. 

[ 1 ] 3GPP TS 26. 1 74: "AMR speech codec, wideband; Test sequences" . 

[2] 3GPP TS 26.190: "Mandatory Speech Codec speech processing functions AMR Wideband speech 

codec; Transcoding functions". 

[3] 3GPP TS 26.191: "AMR speech codec, wideband; Error concealment of lost frames". 

[4] 3GPP TS 26.192: "Mandatory Speech Codec speech processing functions AMR Wideband Speech 

Codec; Comfort noise aspects". 

[5] 3GPP TS 26.193: "AMR speech codec, wideband; Source controlled rate operation". 

[6] 3GPP TS 26.194: "Mandatory Speech Codec speech processing functions AMR Wideband speech 

codec; Voice Activity Detector (VAD)". 

[7] RFC 3267 'A Real-Time Transport Protocol (RTP) Payload Format and File Storage Format for 

Adaptive Multi-Rate (AMR) and Adaptive Multi-Rate Wideband (AMR-WB) Audio Codecs, June 
2002. 
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Definitions and abbreviations 



3.1 Definitions 

For the purposes of the present document, the terms and definitions given in TS 26.190 [2], TS 26.191 [3], TS 26.192 
[4], TS 26.193 [5] and TS 26.194 [6]. 

3.2 Abbreviations 

For the purposes of the present document, the following abbreviations apply: 

AMR-WB Adaptive Multi-Rate WideBand 

ANSI American National Standards Institute 

GSM Global System for Mobile communications 

I/O Input/Output 

RAM Random Access Memory 

ROM Read Only Memory 



C code structure 



This clause gives an overview of the structure of the bit-exact C code and provides an overview of the contents and 
organization of the C code attached to the present document. 

The C code has been verified on the following systems: 

IBM PC/AT compatible computers with Windows NT40 and Microsoft Visual C-n- v. 6.0 compiler. 

- IBM PC/AT compatible computers with Windows NT40 and Intel C/C-H- v.4.0 compiler. 

ANSI-C was selected as the programming language because portability was desirable. 

4.1 Contents of the C source code 

The C code distribution has all files in the root level. 

The distributed files with suffix "c" contain the source code and the files with suffix "h" are the header files. The ROM 
data is contained in "rom" files with suffix "c". 

Makefiles are provided for the platforms in which the C code has been verified (listed above). Once the software is 
installed, this directory will have a compiled version of encoder and decoder and all the object files. 

4.2 Program execution 

The Adaptive Multi-Rate Wideband codec is implemented in two programs: 

(encoder) speech encoder; 

(decoder) speech decoder. 
The programs should be called like: 

encoder [encoder options] <speech input file> <parameter file>; 

decoder <parameter file> <speech output file>. 

The speech files contain 16-bit linear encoded PCM speech samples and the parameter files contain encoded speech 
data and some additional flags. 
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The encoder and decoder options will be explained by running the applications without input arguments. See the file 
readme.txt for more information on how to run the encoder and decoder programs. 



4.3 Code hierarchy 



Tables 1 and 2 are call graphs that show the functions used in the speech codec, including the functions of VAD, DTX, 
and comfort noise generation. 

Each column represents a call level and each cell a function. The functions contain calls to the functions in rightwards 
neighbouring cells. The time order in the call graphs is from the top downwards as the processing of a frame advances. 
All standard C functions: memcpyO, fwrite(), etc. have been omitted. The initialization of the static RAM (i.e. calling 
the _init functions) is also omitted. 
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Table 1 : Speech encoder call structure 



E_MAIN_encode 


E UTIL decim 12k8 


E_UTIL_down_samp | E_UTIL_interpol 




E UTIL decim 12k8 






E_UTIL_hp50_12k8 




E_UTIL_hp50_12k8 




E_UTIL_f_preemph 




E_DTX_vad 


E_DTX_filter_bank 


E DTX filters 




E_DTX_filter3 




E DTX level calculation 




E_DTX_decision 


E_DTX_noise_estimate_up 
date 


E_DTX_update_cntrl 


E_DTX_hangover_addition 




E_DTX_speech_estimate 






E_DTX_tx_handler 








E DTX reset 


E LPC isf init 




E_M A 1 N_parm_sto re 






E UTIL autocorr 






E_LPC_lag_wind 




E LPC lev dur 








E_LPC_a_isp_conversion 


E_LPC_chebyshev 






E_LPCJ_int_isp_find 


E_LPC_f_isp_a_conversi 
on 


E_LPC_f_isp_pol_get 




E_LPC_isp_isf_conversion 






E_GAIN_clip_isf_test 




E_LPC_a_weight 




E_UTIL_residu 




E_UTIL_deemph 




E_GAIN_lp_decim2 




E_GAIN_open_loop_search 








E_GAIN_olag_median 


E GAIN sort 




E_DTX_pitch_tone_detection 






E_GAIN_open_loop_search 






E_GAIN_olag_median 




E_DTX_pitch_tone_detection 




E_UTIL_residu 




E DTX buffer 








E_DTX_exe 


E DTX frame indices fin 
d 






E_DTX_isf_history_aver 




E_DTX_isf_q 


E_LPC_isf_sub_vq 




E_LPC_isf_noise_d 


E_LPC_f_isf_reorder 


E_DTX_dithering_control 






E_UTIL_random 




E_MAIN_reset 


E_GAIN_clip_init 




E_DTX_reset 




E DTX vad reset 




E_LPC_isf_2s3s_quantise 


E_LPC_stage1_isf_vq 




E_LPC_isf_sub_vq 




E_LPC_stage1 _isf_vq 




E_LPC_isf_sub_vq 




E LPC isf 2s3s decode 


E LPC isf reorder 




E_LPC_isf_2s5s_quantise 


E_LPC_stage1_isf_vq 






E_LPC_isf_sub_vq 




E_LPC_isf_2s5s_decode 


E_L P CJsf _reo rd e r 




E_LPC_isf_isp_conversion 






E_LPC_int_isp_find 


E_LPC_isp_a_conversion 


E_LPC_isp_pol_get 


E_UTIL_l_extract 


E_UTIL_mpy_32_16 


E_UTIL_l_extract 




E_UTIL_mpy_32_16 




E UTIL residu 






E DTX buffer 




E UTIL residu 




E_UTIL_synthesis 




E_LPC_a_weight 




E_UTIL_residu 




E_UTIL_deemph 




E_UTIL_f_preemph 
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E_LPC_a_weight 






E_UTIL_synthesis 




E UTIL residu 




E_LPC_a_weight 




E_UTIL_synthesis 




E_UTIL_deemph 




E_GAIN_closed_loop_search 


E GAIN norm corr 


E UTIL f convolve 




E_GAIN_norm_corr_inter 
polate 






E_GAIN_clip_test 








E_GAIN_adaptive_codebook_ 
excitation 






E_UTIL_convolve 




E_ACELP_xy1_corr 




E_ACELP_codebook_target_u 
pdate 




E_UTIL_convolve 




E_ACELP_xy1_corr 




E_ACELP_codebook_target_u 
pdate 




E_UTIL_f_preemph 




E_GAIN_f_pitch_sharpening 




E_ACELP_xh_corr 




E ACELP 2t 








E_ACELP_4t 


E_ACELP_h_vec_corr1 






E ACELP h vac corr2 




E_ACELP_2pulse_search 




E_ACELP_quant_1p_N1 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 


E_ACELP_quant_2p_2N1 




E_ACELP_quant_1p_N1 




E_ACELP_quant_4p_4N 


E_ACELP_quant_4p_4N1 


E ACELP quant 2p 2N 
1 


E_ACELP_quant_1p_N1 




E_ACELP_quant_3p_3N1 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 




E_ACELP_quant_5p_5N 


E_ACELP_quant_3p_3N1 




E_ACELP_quant_2p_2N1 




E ACELP quant 6p 6N 
2 


E_ACELP_quant_5p_5N 




E_ACELP_quant_1p_N1 




E_ACELP_quant_4p_4N 




E_ACELP_quant_2p_2N1 




E_ACELP_quant_3p_3N1 




E_UTIL_preemph 






E_GAIN_pitch_sharpening 




E_ACELP_xy2_corr 




E_ACELP_gains_quantise 


E_UTIL_dot_product12 


E UTIL saturate 31 




E_UTIL_norm_l 




E_UTIL_normalized_inver 
se_sqrt 






E UTIL 1 extract 




E_UTIL_saturate 




E_UTIL_mpy_32_16 




E_UTIL_log2_32 


E_UTIL_norm_l 




E_UTIL_normalized_log2 




E_UTIL_signal_up_scale 






E_UTIL_signal_down_scale 




E_GAIN_clip_pit_test 




E_UTIL_signal_down_scale 








E_GAIN_voice_factor 


E_UTIL_dot_product12 






E UTIL norm 1 




E UTIL norm s 




E_UTIL_norm_s 








E_UTIL_synthesis 




E_UTIL_enc_synthesis 


E_UTIL_synthesis 






E_UTIL_deemph 




E_UTIL_hp50_12k8 
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E UTIL random 


E_UTIL_hp400_12k8 


E_LPC_a_weight 


E_UTIL_synthesis 


E_UTIL_bp_6k_7k 


E_UTIL_bp_6k_7k 
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Table 2: Speech decoder call structure 



D_MAIN_decode 


D DTX rx handler 


D LPC isf noise d 


D LPC isf reorder 




D_DTX_exe 


D_DTX_cn_dithering 


D UTIL random 




D_UTIL_pow2 






D UTIL norm 1 




D_UTIL_random 




D_UTIL_dot_product12 


D UTIL norm 1 




D_UTIL_normalized_inver 
se sqrt 






D_LPC_isf_isp_conversio 
n 








D_LPC_isp_a_conversion 


D_LPC_isp_pol_get 


D UTIL 1 extract 




D_UTIL_mpy_32_16 




D UTIL 1 extract 






D_UTIL_mpy_32_16 




D_UTIL_dec_synthesis 


D_UTIL_synthesis_32 




D_UTIL_deemph_32 


D UTIL saturate 




D_UTIL_hp50_12k8 


D UTIL 1 extract 




D_UTIL_oversamp_1 6k 


D_UTIL_up_samp 


D_UTIL_interpol 


D_UTIL_random 






D_UTIL_signal_down_sca 

le 




D_UTIL_dot_product12 




D_UTIL_normalized_inver 
se_sqrt 




D_UTIL_hp400_12k8 


D UTIL 1 extract 




D_UTIL_norm_l 






D_LPC_isf_extrapolation 


D UTIL norm s 




D_UTIL_l_extract 




D_UTIL_mpy_32 




D_LPC_isf_isp_conversio 
n 




D_LPC_isp_a_conversion 






D_LPC_a_weight 




D_UTIL_synthesis 




D_LPC_a_weight 




D_UTIL_synthesis 




D_UTIL_bp_6k_7k 




D_UTIL_hp_7k 




D_MAIN_reset 


D GAIN init 




D GAIN lag concealmen 
t init 




D_DTX_reset 




D LPC isf 2s3s decode 


D LPC isf reorder 




D_LPC_isf_2s5s_decode 


D_LPC_isf_reorder 




D_LPC_isf_isp_conversio 
n 








D_LPC_int_ispJind 


D_LPC_isp_a_conversion 






D_GAIN_lag_concealmen 

t 


D_GAIN_sort_lag 


D_GAIN_insert_lag 




D_UTIL_random 






D_GAIN_adaptive_codeb 
ook excitation 








D_UTIL_random 






D ACELP decode 2t 








D_ACELP_decode_4t 


D ACELP decode 1p N 
1 






D_ACELP_add_pulse 




D ACELP decode 2p 2 
N1 




D ACELP decode 3p 3 
N1 


D ACELP decode 2p 2 
N1 




D ACELP decode 1p N 
1 




D ACELP decode 4p 4 
N 


D ACELP decode 4p 4 
N1 


D ACELP decode 2p 2 
N1 


D ACELP decode 2p 2 
N1 


D ACELP decode 1p N 
1 
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D AGELP decode 3p 3 
N1 


D AGELP decode 2p 2 
N1 


D AGELP deeode 5p 5 
N 


D AGELP decode 3p 3 
N1 


D AGELP decode 2p 2 
N1 


D AGELP deeode 6p 6 
N_2 


D AGELP decode 5p 5 
N 


D AGELP decode 1p N 
1 


D AGELP decode 4p 4 
N 


D AGELP decode 2p 2 
N1 


D AGELP decode 3p 3 
N1 


D_UTIL_preemph 






D_GAIN_pitch_sharpenin 

g 




D_GAIN_decode 


D_UTIL_dot_produet12 




D_UTIL_normalized_inver 
se sqrt 




D_GAIN_median 




D UTIL 1 extract 




D_UTIL_pow2 




D_UTIL_mpy_32_16 




D_UTIL_log2 


D_UTIL_norm_l 


D_UTIL_normalized_log2 


D_UTIL_signal_up_scale 


D UTIL saturate 




D UTIL signal down sea 
le 






D_G Al N_f i nd_voice_f acto 
r 


D_UTIL_dot_product12 




D UTIL norm 1 




D_UTIL_norm_s 




D UTIL norm s 






D_UTIL_l_extract 




D_ACELP_phase_disper 




D_UTIL_mpy_32_16 




D UTIL 1 extract 




D_GAIN_adaptive_control 


D_UTIL_norm_l 




D_UTIL_inverse_sqrt 




D_UTIL_dec_synthesis 


D UTIL saturate 




D UTIL signal down sea 
le 






D_DTX_aeti vity_u pdate 


D_UTIL_log2 





4.4 



Variables, constants and tables 



The data types of variables and tables used in the floating-point implementation are signed integers in 2's complement 
representation, defined by: 

Words 8 bit variable 
UWordS 8 bit unsigned variable 

Wordl6 16 bit variable 
Wordl6 16 bit unsigned variable 
Word32 32 bit variable 

Floating-point numbers use the IEEE (Institute of Electrical and Electronics Engineers) format: 

Float32 8 bit exponent, 23 bit mantissa, 1 bit sign 
Float64 1 1 bit exponent, 52 bit mantissa, 1 bit sign 
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4.4.1 Description of fixed tables used in tine C-code 

This clause contains a listing of all fixed tables declared in enc_rom.c and dec_rom.c files. 

Table 3: Encoder fixed tables 



Format 


Table name 


Size 


Description 


Word 16 


E ROM cdown unusable 


7 


Attenuation factors for codebook gain in lost frames 


Word 16 


E ROM cdown usable 


7 


Attenuation factors for codebook gain in bad frames 


Float32. 


E ROM corrweight 


199 


Weighting of the correlation function in open loop LIP search 


Word 16 


E ROM cos 


129 


Table of cos(x) 


Float32 


E ROM dicol isf 


9*256 


1st ISF quantizer of the 1st stage 


Float32 


E ROM dicol isf noise 


2*64 


1 St ISF quantizer for comfort noise 


Float32 


E ROM dico21 isf 


3*64 


1st ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico21 isf 36b 


5*128 


1st ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico22 isf 


3*128 


2nd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico22 isf 36b 


4*128 


2nd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico23 isf 


3*128 


3rd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico23 isf 36b 


7*64 


3rd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Float32 


E ROM dico24 isf 


3*32 


4th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico25 isf 


4*32 


5th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Float32 


E ROM dico2 isf 


7*256 


2nd ISF quantizer of the 1st stage 


Float32 


E ROM dico2 isf noise 


3*64 


2nd ISF quantizer for comfort noise 


Float32 


E ROM dico3 isf noise 


3*64 


3rd LSF quantizer for comfort noise 


Float32 


E ROM dico4 isf noise 


4*32 


4th LSF quantizer for comfort noise 


Float32 


E ROM dico5 isf noise 


4*32 


5th LSF quantizer for comfort noise 


Float32 


E ROM en adjust 


9 


Energy scaling factor for each mode during comfort noise 


Float32 


E_ROM_f_interpol_frac 


4 


LPC interpolation coefficients 


Float32 


E ROM fir 6k 7k 


31 


Bandpass FIR filter coefficients for higher band generation 


Word 16 


E ROM fir down 


120 


Downsample FIR filter coefficients 


Float32 


E_ROM_fir_ipol 


61 


Interpol FIR filter coefficients 


Word 16 


E_ROM_fir_up 


120 


Upsample FIR filter coefficients 


Float32 


E ROM grid 


101 


Chebyshev polynomial grid points 


Float32 


E_ROM_hamming_cos 


384 


LP analysis window 


Float32 


E ROM hp gain 


16 


High band gain table for 23.85 kbit/s mode 


Float32 


E ROM inter4 1 


4*2*4 


Interpolation filter coefficients 


Word 16 


E ROM inter4 2 


4*2*16 


Interpolation filter coefficients 


Word 16 


E ROM Interpol frac 


4 


Interpolation filter coefficients 


Float32 


E ROM isf 


16 


ISF table for initialization 


Word 16 


E ROM isp 


16 


ISP table for initialization 


Word 16 


E_ROM_isqrt 


49 


Table used in inverse square root computation 


Float32 


E_ROM_lag_window 


16 


Lag window table 


Word 16 


E ROM log2 


33 


Table used in logarithm computation 


Float32 


E ROM f mean isf 


16 


ISF mean 


Word 16 


E ROM mean isf 


16 


ISF mean 


Float32 


E ROM mean isf noise 


16 


ISF mean for comfort noise 


Word 16 


E ROM pdown unusable 


7 


Attenuation factors for adaptive codebook gain in lost frames 


Word 16 


E_ROM_pdown_usable 


7 


Attenuation factors for adaptive codebook gain in bad frames 


Word 16 


E_ROM_pow2 


33 


Table used in power of two computation 


Float32 


E ROM qua gainGb 


2*64 


Gain quantization table for 6-bit gain quantization 


Float32 


E_R0M_qua_gain7b 


2*128 


Gain quantization table for 7-bit gain quantization 


UwordS 


E_ROM_tipos 


36 


Starting point for codebook search 
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Table 4: Decoder fixed tables 



Format 


Table name 


Size 


Description 


Word 16 


D ROM cdown unusable 


7 


Attenuation factors for codebook gain in lost frames 


Wordie 


D ROM cdown usable 


7 


Attenuation factors for codebook gain in bad frames 


Word 16 


D ROM cos 


129 


Table of cos(x) 


Wordie 


D ROM died isf 


9*256 


1st ISF quantizer of the 1st stage 


Word 16 


D ROM dicol isf noise 


2*64 


1 St ISF quantizer for comfort noise 


Wordie 


D ROM dico21 isf 


3*64 


1st ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Wordie 


D ROM dico21 isf 36b 


5*128 


1 St ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Wordie 


D ROM dico22 isf 


3*128 


2nd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Wordie 


D ROM dico22 isf 36b 


4*128 


2nd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Wordie 


D ROM dico23 isf 


3*128 


3rd ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Wordie 


D ROM dico23 isf 36b 


7*64 


3rd ISF quantizer of the 2nd stage (the 6.60 kbit/s mode) 


Wordie 


D ROM dico24 isf 


3*32 


4th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Wordie 


D ROM dico25 isf 


5*32 


5th ISF quantizer of the 2nd stage (not the 6.60 kbit/s mode) 


Wordie 


D ROM dico2 isf 


7*256 


2nd ISF quantizer of the 1st stage 


Wordie 


D ROM dico2 isf noise 


3*64 


2nd ISF quantizer for comfort noise 


Wordie 


D ROM dico3 isf noise 


3*64 


3rd LSF quantizer for comfort noise 


Wordie 


D ROM dico4 isf noise 


4*32 


4th LSF quantizer for comfort noise 


Wordie 


D ROM dico5 isf noise 


4*32 


5th LSF quantizer for comfort noise 


Wordie 


D ROM fir 6k 7k 


31 


Bandpass FIR filter coefficients for higher band generation 


Wordie 


D ROM fir 7k 


31 


Bandpass FIR filter coefficients for higher band in 23.85 kbit/s mode 


Wordie 


D ROM fir down 


120 


Downsample FIR filter coefficients 


Wordie 


D ROM fir up 


120 


Upsample FIR filter coefficients 


Wordie 


D ROM hp gain 


16 


High band gain table for 23.85 kbit/s mode 


Wordie 


D ROM inter4 2 


4*2*16 


Interpolation filter coefficients 


Wordie 


D ROM interpol frac 


4 


LPG interpolation coefficients 


Wordie 


D ROM isf 


16 


ISF table for initialization 


Wordie 


D ROM isp 


16 


ISP table for initialization 


Wordie 


D ROM isqrt 


49 


Table used in inverse square root computation 


Wordie 


D ROM log2 


33 


Table used in logarithm computation 


Wordie 


D ROM mean isf 


16 


ISF mean 


Wordie 


D ROM mean isf noise 


16 


ISF mean for comfort noise 


Wordie 


D ROM pdown unusable 


7 


Attenuation factors for adaptive codebook gain in lost frames 


Wordie 


D ROM pdown usable 


7 


Attenuation factors for adaptive codebook gain in bad frames 


Wordie 


D ROM ph imp low 


64 


Phase dispersion impulse response 


Wordie 


D ROM ph imp mid 


64 


Phase dispersion impulse response 


Wordie 


D ROM pow2 


33 


Table used in power of two computation 


Wordie 


D ROM qua gain6b 


2*64 


Gain quantization table for e-bit gain quantization 


Wordie 


D ROM qua gain7b 


2*128 


Gain quantization table for 7-bit gain quantization 



4.4.2 Static variables used in tine C-code 

In this clause two tables that specify the static variables for the speech encoder and decoder respectively are shown. All 
static variables are declared within a C struct. 
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Table 5: Speech encoder static variables 



Struct name 


Variable 


Type 


Length 


Description 


Coder_State 


mem_speech 


Float32 


384 


speech buffer 




mem_wsp 


Float32 


371 


buffer holding spectral weighted speech 




mem_hp_wsp 


Float32 


243 


highpass wsp 




mem_decim 


Float32 


30 


Open-loop LTP decimation filter memory 
Estimated BP filter memory (23.85 kbit/s 




mem_hf 


Float32 


30 


mode) 
Input BP filter memory (23.85 kbit/s 




mem_hf2 


Float32 


30 


mode) 
Input LP filter memory (23.85 kbit/s 




mem_hf3 


Float32 


30 


mode) 




memjsp 


Float32 


16 


Old ISP vector 




mem_syn 


Float32 


16 


synthesis filter memory 




mem_syn2 


Float32 


16 


modified synthesis memory 




mem_syn_hf 


Float32 


16 


Higher band synthesis filter memory 




memjsf 


Float32 


16 


Old ISF vector 




mem_hf_wsp 


Float32 


9 


Open-loop lag gain filter memory 




mem_sig_in 


Float32 


4 


Prefilter memory 




mem_sig_out 


Float32 


4 


HP filter memory in the synthesis 




mem_hp400 


Float32 


4 


HP filter memory 




mem_decim2 


Float32 


3 


Open-loop LTP decimation filter memory 




mem_gp_clip 


Float32 


2 


Memory of pitch clipping 




mem_preemph 


Float32 


1 


Preemphasis filter memory 




mem_deemph 


Float32 


1 


Deemphasis filter memory 
Open-loop LTP deemphasis filter 




mem_wsp_df 


Float32 


1 


memory 
Weighting filter memory (applied to error 




mem_wO 


Float32 


1 


signal) 




mem_ol_gain 


Float32 


1 


Open-loop gain 
Weighting level depeding on open loop 




mem_ada_w 


Float32 


1 


pitch gain 




mem_gc_threshold 


Float32 


1 


Noise enhancer threshold 
Higher band gain weighting factor (23.85 




mem_gain_alpha 


Float32 


1 


kbit/s mode) 




mem_ol_lag 


Word32 


5 


Open loop lag history 




mem_TO_med 


Word32 


1 


Weighted open loop pitch lag 




mem_exc 


Word 16 


505 


Excitation vector 




mem_isp_q 


Word 16 


16 


Old ISP vector 




mem_isf_q 


Word 16 


16 


Past quantized ISF prediction error 




mem_gain_q 


Word 16 


4 


Gain quantization memory 




mem_subfr_q 


Word 16 


4 


Scaling factor history 




mem_tilt_code 


Word 16 


1 


Preemhasis filter memory 




mem_q 


Word 16 


1 


Old scaling factor 




mem seed 


Word 16 


1 


Random generation seed 




*vadSt 


E DTX Vad State 


1 


See below in this table 




*dtx_encSt 


E DTX State 


1 


See below in this table 




mem_first_frame 


UWordS 


1 


First frame indicator 




mem_ol_wght_flg 


UWordS 


1 


Switches lag weighting on and off 




mem vad hist 


UWordS 


1 


VAD history 


E_DTX_State 


memjsf 


Float32 


128 


LSP history 




mem_distance 


Float32 


28 


ISF history distance matrix 




mem_distance_sum 


Float32 


8 


Sum of ISF history distances 




mem_log_en 


Float32 


8 


Logarithmic frame energy history 




mem_hist_ptr 


Word 16 


1 


Pointer to the cyclic history vectors 




mem_log_en_index 


Word 16 


1 


Index for logarithmic energy 




mem_cng_seed 


Word 16 


1 


Comfort noise excitation seed 




mem_dtx_hangover_count 


Word 16 


1 


DTX hangover period 
Counter for elapsed speech frames in 




mem dec ana elapsed count 


Word 16 


1 


DTX 


E_DTX_Vad_State 


mem_pow_sum 


Float64 


1 


Power of previous frame 




mem_bckr_est 


Float32 


12 


Background noise estimate 
Averaged input components for 




mem_ave_level 


Float32 


12 


stationary estimation 




memjeve 


Float32 


12 


Input levels of the previous frame 
Input levels calculated at the end of a 




mem sub level 


Float32 


12 


frame (lookahead) 
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Struct name 


Variable 


Type 


Length 


Description 




mem_a_data5 


Float32 


10 


Memory for the filter bank 




mem_a_data3 


Float32 


6 


Memory for the filter bank 




mem_sp_max 


Float32 




Maximum level 




mem_speech_level 


Float32 




Estimated speech level 




mem_burst_count 


Word 16 




Counts length of a speech burst 




mem_hang_count 


Word 16 




Hangover counter 




mem_stat_count 


Word 16 




Stationary counter 




mem_vadreg 


Word 16 




Flags for intermediate VAD decisions 




mem_pitch_tone 


Word 16 




Flags for pitch and tone detection 




mem_sp_est_cnt 


Word 16 




Counter for speech level estimation 




mem sp max cnt 


Word 16 




Counts frames that contains speech 
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Table 6: Speech decoder static variables 



Struct name 


Variable 


Type 


Length 


Description 


DecoderState 


mem_gc_thres 


Word32 


1 


Threshold for noise enhancer 




mem_exc 


Word 16 


505 


INTERPOL]; /* old excitation vector 




mem_isf_buf 


Word 16 


4S 


ISF buffer{frequency domain) 




mem_hf 


Word 16 


30 


HF band-pass filter memory 




mem_hf2 


Word 16 


30 


HF band-pass filter memory 




mem_hf3 


Word 16 


30 


HF band-pass filter memory 




mem_oversamp 


Word 16 


24 


Synthesis oversampled filter memory 




mem_gain 


Word 16 


23 


Gain decoder memory 




mem_syn_hf 


Word 16 


20 


HF synthesis memory 




memjsp 


Word 16 


16 


Old ISP (immittance spectral pairs) 




memjsf 


Word 16 


16 


Old ISF (frequency domain) 




mem_isf_q 


Word 16 


16 


Past ISF quantizer 




mem_syn_hi 


Word 16 


16 


Modified synthesis memory (MSB) 




mem_syn_lo 


Word 16 


16 


Modified synthesis memory (LSB) 




mem_ph_disp 


Word 16 


8 


Phase dispersion memory 




mem_sig_out 


Word 16 


6 


Hp50 filter memory for synthesis 




mem_hp400 


Word 16 


6 


Hp400 filter memory for synthesis 




memjag 


Word 16 


5 


LTP lag history 




mem_subfr_q 


Word 16 


4 


Old maximum scaling factor 




mem_tilt_code 


Word 16 




Tilt of code 




mem_q 


Word 16 




Old scaling factor 




mem_deemph 


Word 16 




Speech deemph filter memory 




mem_seecl 


Word 16 




Random memory for frame erasure 




mem_seed2 


Word 16 




Random memory for HF generation 




mem_seed3 


Word 16 




Random memory for lag concealment 




mem_TO 


Word 16 




Old pitch lag 




mem_TO_frac 


Word 16 




Old pitch fraction lag 




mem vad hist 


UWordie 




VAD history 




dtx_decSt 


D DTX State 




See below in this table 




mem_bfi 


UWordS 




Previous BFI 




mem_state 


UWordS 




BGH state machine memory 




mem first frame 


UWordS 




First frame indicator 


dtxdecState 


mem_isf_buf 


Word 16 


128 


ISF vector history (8 frames) 




memjsf 


Word 16 


16 


ISF vector 




mem_isf_prev 


Word 16 


16 


Previous ISF vector 




mem_log_en_buf 


Word 16 


8 


Logarithmic frame energy history 




mem_true_sid_period_inv 


Word 16 




Inverse of true SID update rate 




mem_log_en 


Word 16 




Logarithmic frame energy 




mem_log_en_prev 


Word 16 




Previous logarithmic frame energy 




mem_cng_seed 


Word 16 




Comfort noise excitation seed 




mem_hist_ptr 


Word 16 




Index to beginning of LSF history 




mem_dither_seed 


Word 16 




Comfort noise dithering seed 




mem_cn_dith 


Word 16 




Background noise stationarity information 




mem_since_last_sid 


Word 16 




Number of frames since last SID frame 




mem_dec_ana_elapsed_count 


UWordS 




Counts elapsed speech frames after DTX 




mem_dtx_global_state 


UWordS 




DTX state flags 




mem_data_updated 


UWordS 




Flags CNI updates 




mem_dtx_hangover_count 


UWordS 




Counts down in hangover period 




mem_sid_frame 


UWordS 




Flags SID frames 




mem_valid_data 


UWordS 




Flags SID frames containing valid data 




mem dtx hangover added 


UWordS 




Flags hangover period at end of speech 



Homing procedure 



The principles of the homing procedures are described in [2]. The present document only includes a description of the ' 
decoder homing frames. For each AMR-WB codec mode, the corresponding decoder homing frame has a fixed set of 
speech parameters. Table 7 shows the homing frame speech parameters for different modes. 
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Table 7: Table values for the decoder homing frame parameters for different modes 



Mode 



Speech Parameters 



0, 49, 131, 84, 5, 50, 29, 2015, 8,0, 2061, 8,1, 3560, 8,0, 2981, 8 

0, 49, 131,55, 49, 38,26, 29, 29,3, 15, 7,15, 8, 16,13, 7, 17,16, 8, 0,16, 20, 16,27, 8, 23,0, 27, 0,27, 8 

0,49, 131,55,49,38,26,29,58, 1,7,63,127, 15,70,37, 1,209,210,224,96,31,7, 1,256,260,271, 

443, 31 , 47, 0, 400, 238, 436, 347, 31 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 3847, 3845, 63, 127, 70, 34, 0, 3128, 4517, 192, 96, 0, 2, 1, 4160, 

8036, 267, 443, 31 , 46, 0, 3840, 7091 , 432, 395, 31 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 3847, 3845, 3847, 3843, 70, 31, 0, 3648, 4764, 824, 2864, 0, 6, 1, 

41 60, 5220, 431 9,7131,31, 47, 0,112, 3764, 219,211,31 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 3, 2, 3, 2, 7223, 703, 7223, 703, 70, 0, 1, 3, 2, 2, 3, 9475, 9483, 3090, 

8737, 0, 0, 1 , 0, 0,2,0,411 2, 4400, 841 5, 1 4047, 31 , 38, 0, 2, 1 , 3, 1 , 91 , 426, 1 3545, 1 2955, 

0, 49, 131, 55, 49, 38, 26, 29, 58, 1, 161, 759, 3, 2, 127, 516, 6167, 447, 70, 11, 1, 264, 641, 2, 3, 123, 562, 

8347, 4354, 0, 1, 1, 264, 408, 3, 0, 256, 308, 9487, 14047, 31, 46, 0, 320, 885, 2, 2, 464, 439, 11347, 

12739,0 

0, 49, 1 31 , 55, 49, 38, 26, 29, 58, 1 , 1 1 54, 1 729, 1 1 54, 1 761 , 447, 1 51 9, 959, 495, 70, 27, 1 , 1 800, 1 253, 

665, 1 960, 546, 1 64, 1 043, 335, 0, 28, 1 , 580, 1 96, 1 1 87, 383, 1 031 , 1 052, 359, 1 531 , 31 , 45, 1 , 1 024, 893, 

1 272, 1 920, 101, 876, 203, 1119,31 

0, 49, 1 31 , 55, 49, 38, 26, 29, 58, 1 , 1 729, 1 1 54, 1 761 , 1 1 54, 1519, 959, 495, 447, 70, 3, 42, 1 , 580, 1 436, 

1362, 1250, 901, 714, 24, 45, 0, 0, 0, 1, 68, 708, 1212, 383, 1048, 1611, 1756, 1467, 31, 1, 23, 0, 1536, 

1460, 861, 1554, 410, 1368, 1008, 594, 31, 



6 File formats 

This clause describes the file formats used by the encoder and decoder programs. The test sequences defined in [1 also 
use the file formats described here. 

6.1 Speech file (encoder input/decoder output) 

Speech files read by the encoder and written by the decoder consist of 16-bit words where each word contains a 14-bit, 
left aligned speech sample. The byte order depends on the host architecture (e.g. MSByte first on SUN workstations, 
LSByte first on PCs etc.). Both the encoder and the decoder program process complete frames (of 320 samples) only. 

This means that the encoder will only process n frames if the length of the input file is n'*320 + k words, while the files 
produced by the decoder will always have a length of n* 3 20 words. 

6.2 Mode control file (encoder input) 

The encoder program can optionally read in a mode control file which specifies the encoding mode for each frame of 
speech processed. The file is a text file containing one number per speech frame. Each line contains one of the mode 
numbers 0-8. 

6.3 Parameter bitstream file (encoder output/decoder input) 

The files produced by the speech encoder/expected by the speech decoder are described in RFC 3267 [7], sections 5.1 
and 5.3. 

By using a preprocessor definition encoder/decoder can optionally use format described in TS26.201 that defines an 
octet-aligned frame format (Interface format 2) for the AMR-WB codec. 
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Annex A (informative): 
Change history 



Change history 


Date 


TSGff 


TSG Doc. 


CR 


Rev 


Subject/Comment 


Old 


New 


2002-03 


15 


SP-020073 






Presented at TSG SA#1 5 for approval 


2.0.0 


5.0.0 


2003-03 


19 


SP-030090 


001 


1 


Correction to log(O) error in VAD decision witli low SNR 
input signals 


5.0.0 


5.1.0 


2003-03 


19 


SP-030090 


002 


1 


Correction to decoder with input of long sequence of 
NO_DATA frames 


5.0.0 


5.1.0 


2003-03 


19 


SP-030090 


003 


1 


Correction to "D_UTIL_pow2" function to be bitexact 
with TS26.173 counterpart 


5.0.0 


5.1.0 


2003-03 


19 


SP-030090 


004 


1 


MMS compatible i/o format option 


5.0.0 


5.1.0 


2003-03 


19 


SP-030090 


005 




Correction for handling of RX_NO_DATA frames 


5.0.0 


5.1.0 


2003-03 


19 


SP-030090 


006 


1 


Ambiguous expressions in the AMR-WB Floating-point 
C-Code 


5.0.0 


5.1.0 


2003-09 


21 


SP-030447 


008 




Possible decoder LPC coefficients overflow 


5.1.0 


5.2.0 


2004-12 


26 


SP-040844 


009 


1 


Incorrect definition of vector nb of bits 


5.2.0 


6.0.0 


2007-03 


35 


SP-070029 


0011 




IVIaintaining bit-exactness with TS 26.173 after 
Correction in AIVIR decoder to avoid division by zero in 
RX-DTX Handling 


6.0.0 


7.0.0 


2007-03 


35 


SP-070029 


0012 




Bug fix to SID frame signaling in decoder 


6.0.0 


7.0.0 


2007-09 


37 


SP-070626 


0015 


1 


Robust operation of AMRWB-decoder 


7.0.0 


7.1.0 


2008-12 


42 








Version for Release 8 


7.1.0 


8.0.0 


2009-12 


46 








Version for Release 9 


8.0.0 


9.0.0 


2011-03 


51 








Version for Release 10 


9.0.0 


10.0.0 
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